TC-STAR: New language resources for ASR and SLT purposes
نویسندگان
چکیده
In TC-STAR a variety of Language Resources (LR) is being produced. In this contribution we address the resources that have been created for Automatic Speech Recrognition and Spoken Language Translation. As yet, these are 14 LR in total: two training SLR for ASR (English and Spanish), three development LR and three evaluation LR for ASR (English, Spanish, Mandarin), and three development LR and three evaluation LR for SLT (English-Spanish, Spanish-English, Mandarin-English). In this paper we describe the properties, validation, and availability of these resources.
منابع مشابه
Validation of language resources in TC-STAR
In TC-STAR a variety of Language Resources (LR) are being produced. In this contribution we address the validation of resources that were created and used for the second Evaluation Campaign of the project. For the three types of topics covered by the project (ASR, SLT, TTS) the validation of both development and evaluation sets is described. For each type we successively address the description...
متن کاملICT System Description for the 2006 TC-STAR Run #2 SLT Evaluation
This paper describes systems participated in 2006 TC-STAR Run #2 SLT Evaluation of Institute of Computing Technology, Chinese Academy of Sciences. We developed three systems based on different techniques: system Confucius based on phrase, system Lynx based on tree-to-string alignment template and system Bruin based on BTG (Bracketing Transduction Grammar). These three systems share the same phr...
متن کاملEnd-to-End Evaluation of a Speech-to-Speech Translation System in TC-STAR
The paper describes an evaluation methodology to evaluate speech-to-speech translation systems and their results. The evaluation scheme uses questionnaires filled in by human judges for addressing the adequacy and fluency of audio translation outputs and was applied in the second TC-STAR evaluation campaign. The same evaluation methodology is carried out both on the outputs of an automatic syst...
متن کاملEvaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR: Results from the first evaluation campaign
This paper reports on the evaluation activities conducted in the first year of the TC-STAR project. The TC-STAR project, financed by the European Commission within the Sixth Framework Program, is envisaged as a long-term effort to advance research in the core technologies of Speech-to-Speech Translation (SST). SST technology is a combination of Automatic Speech Recognition (ASR), Spoken Languag...
متن کاملPseudo-morpheme and Confusion Network Based Korean-english Statistical Spoken Language Translation System
In this demonstration, we present POSSLT (POSTECH Spoken Language Translation) for a Korean-English statistical spoken language translation (SLT) system using pseudo-morpheme and confusion network (CN) based technique. Like most other SLT systems, automatic speech recognition (ASR) and machine translation (MT) are coupled in a cascading manner in our SLT system. We used confusion network based ...
متن کامل